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ABSTRACT 

By utilising the inbuilt citation counts from NASA's astrophysics data system (ADS) 
I derive how many citations refereed articles receive as a function of time since publi- 
cation. After five years, one paper in a hundred has accumulated 91 or more citations, 
a figure which rises to 145 citations after ten years. By adding up the number of ci- 
tations active researchers have received over the past five years I have estimated their 
relative impact upon the field both for raw citations and citations weighted by the 
number of authors per paper. 
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1 INTRODUCTION 

What makes a good paper? No objective measure is 
ever going to be perfect but being cited in another pa- 
per at least indicates that the work has been noticed 
and thought to be worth mentioning. Papers with many 
citations are, in general, likely to be more useful and in- 
teresting than those that sink without a trace. This is 
much the same system as that used successfully by fast 
internet search engines to score sites so that they can 
provide a list ordered by usefulness; those sites which 
many people link to get a high score and appear near 
the top of the returned list. Such a system is admit- 
tedly far from perfect, as it can be influenced by many 
factors such as having a large number of friends who 
cite you, self-citing your own papers excessively or pro- 
ducing a very good paper that concludes an avenue of 
research in such a way that it doesn't form the basis 
for a large body of subsequent endeavour. That said, 
there is undoubtedly a trend; papers with high citation 
counts tend to be better written, more interesting and 
useful than those that never get referred to again. 

In section 2 I first look at citation classics, the 1000 
most cited astronomy papers according to the ADS. 
Next I examine the citation counts for refereed astron- 
omy papers published since 1970. From this the number 
of citations received by 1-in-lO, 1-in-lOO and 1-in-lOOO 
papers can be obtained, and are shown in table 1. The 
number of citations received by papers of a specified age 
is also shown, where the age stated defines the centre of 
a one year range (so 2 years ago means papers published 
between 18 and 30 months ago). 

When can a researcher be said to have a high ongo- 



ing impact on the field? This question is even harder to 
answer, but a sequence of well cited papers would be a 
good start. In section 3 I suggest two measures of cur- 
rent impact on the field which are calculated by tallying 
up a researchers total citations or normalised total ci- 
tations over a rolling five year interval. I calculate these 
numbers for a random sample of over 5000 astronomers 
with current publications in order to produce likelihood 
curves. 



2 CITATION CLASSICS 

Quite how a paper passes into folklore and becomes a 
citation classic is difficult to determine. To enter the 
ranks of the 1000 most cited astronomical papers in 
the ADS archive requires a paper to have obtained 257 
citations (as of november 2003, when the archive con- 
tained 439746 papers). For the purposes of this study, 
papers are defined to be those found in the astron- 
omy/planetary ADS archive, published in all refereed 
journals. Citations counts are those returned by ADS. It 
should be noted that these citation counts are not com- 
plete, with some references omitted because the citing 
journal is not within the ADS database or because an 
older article has only been scanned at present. Inaccu- 
racies in the citing papers reference list can also lead to 
missed citations. The ADS does contain complete ref- 
erence lists for all the major astrophysics journals back 
to issue 1. 

Figure 1 shows the distribution by publication year 
of the 1000 most cited astronomical papers. This peaks 
around 1985, a year which contributes nearly 50 papers 
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Table 1. The number of citations required to cross successive 
power-of-tcn likelihoods for papers published a specified number 
of years (plus six months) ago. The last four rows show the re- 
quired number of citations and normalised citations (see text) 
required per researcher to reach the same likelihoods. 



Figure 1. Histogram showing the publication year of the 1000 
most cited astronomical papers. As of november 2003 an individ- 
ual paper required 257 citations to make it onto this plot 

to the total. It takes around a decade for the number 
of citation classics per year to rise to over 30, a figure 
which tallies well with the 5 year timescale required 
to reach a maximum citation rate followed by a slow 
decline (Abt 1981). 

The oldest of these (Chandrasekhar 1943), 
'Stochastic Problems in Physics and Astronomy', is a 
true classic, with over 900 citations, whilst the second 
oldest (Bondi & Hoyle 1944), 'On the mechanism of 
accretion by stars' will most likely soon drop out of 
the list as it 'only' has 284 citations. Papers published 
this long ago are likely to have many contemporary ci- 
tations missed as they have not yet been entered into 
the archive. The most recently published additions to 
the list are Ahmad et al. (2001) - (for neutrino mea- 
surements) and Freedman et al. (2001) - (measuring the 
Hubble constant), with just over 300 citations each. 

Many of the most highly cited papers in this list 
are measurements of fundamental parameters - Kurucz 
1979, 'Model atmospheres for G, F, A, B, and O stars', 
Anders & Grevesse 1989, 'Abundances of the elements 
- Meteoritic and solar', Landolt 1992, 'UBVRI photo- 
metric standard stars in the magnitude range 11.5-16.0 
around the celestial equator'. Savage & Mathis, 1979 
'Observed properties of interstellar dust' and Draine & 
Lee, 1984, 'Optical properties of interstellar graphite 
and silicate grains', are all in the top 10 most cited pa- 
pers. 

How many citations do more normal papers receive? 
Using the counts in the ADS archive I have calculated 
the likelihood of a paper achieving a specified number 
of citations for all papers in the archive published since 
1970. I have also calculated these likelihoods for papers 
of a specified age, plus or minus six months. Nowadays 
there are roughly 15,000 papers published per year. 




0.0001 0.001 0.01 0.1 

Likelihood 



Figure 2. Likelihood of a paper obtaining a specified number 
of citations for all papers published since 1970 (bold line) and 
those published a specified number of years ago. The numbers of 
citations required to breach each likelihood decade are given in 
table 1. 



falling to 10,000 and 8,500 ten and twenty years ago. 
Figure 2 shows these likelihoods for a range of ages as 
well as the long term average (bold line) and the re- 
quired number of citations to cross certain thresholds 
are given in table 1. One-in-ten papers published five 
years ago has now received 26 citations, with one-in- 
100 getting more than 91 and one-in-1000 more than 
253 citations. 
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3 GOOD RESEARCHERS 

Defining a rating system for researchers is a subject 
fraught with danger as any single proposed scheme is 
bound to contain inconsistencies. Here I attempt to 
come up with a scheme that provides a general guide- 
line. An individual high score on either of the metrics 
should be subject to further examination and should 
only be treated as an indication - a researcher in the 
top few percent of both metrics is likely to be far more 
widely known than one in the bottom half. 

The basis of both schemes described here is to only 
consider papers published in refereed journals within 
a rolling five year time interval. This timespan is set 
to start five and a half years before the current time 
and finish six months ago (as there are almost no ci- 
tations in the first six months after publication). The 
citation score is then calculated by summing up the 
total number of citations for those papers up to the 
current date using the automatic facility built in to 
ADS. This procedure is achieved by going to the page; 
http://ukads.nottingham.ac.uk/abstract^ervice.html, 
typing a name in the author field and (in November 
2003) the dates 06/1998 and 05/2003 as the range of 
publication dates. The "select references from" field 
needs to be changed to "All refereed journals" and sort- 
ing done by citation count. This should return figures 
for the number of papers found and the total number 
of citations for those papers. There will also be an or- 
dered list of the papers, each with its own citation count, 
although this information is not used here. A second 
measure, the normalised citation score is calculated by 
weighting each paper by the number of authors, reduc- 
ing the impact of large collaborations which often pro- 
duce a large number of papers. No attempt is made 
to remove self citations, however gratuitous (Pearce et 
al. 2000). 

The difficult part of this study is obtaining a use- 
ful list of names to feed to the algorithm. I have taken 
all the unique names plus first initial contributing two 
or more items to the ADS for authors whose surname 
starts with A, B or C. In addition, I have also disabled 
synonym matching on author names, which avoids pat- 
tern matching on middle initials and phonetic pronun- 
ciation matching. This procedure will combine authors 
with names such as Martin, A. S. and Martin, A. C. to- 
gether, but such very similar names appear to be rare, 
at least when both are successful. I have checked that 
results near the top of the study suffer no detectable 
contamination. 

The final list of 18,346 names is automatically 
passed to the ADS server which returns citation and 
normalised citation figures as detailed above. Not all 
these astronomers have published during the five year 
period, as the source list of names spans the entire 
database. In total 5136, or roughly 30% of the au- 
thors have published two or more refereed papers in the 
study period. Figures 3 & 4 below show the likelihood 
of achieving different citation and normalised citation 
counts respectively, both for all the authors with two 
or more citations and for the 2467 active researchers 
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Figure 3. Likelihood of an autiior achieving more than a specified 
number of citations within a recent 5 year window. The lower 
curve is for all authors publishing 2 or more papers in the interval, 
the upper curve is for active researchers with 5 or more recent 
papers. 

who published at least one paper a year over the study 
period. 

As table 1 lists, ten percent of astronomers with two 
or more refereed papers in the last five years received 
more than 231 citations, with this figure rising to 382 
for active researchers. Similarly, one-in-ten publishing 
astronomers receive more than 40 normalised citations 
in the same period. 



4 CONCLUSIONS 

By extracting citation counts from the ADS I have pro- 
duced a list of the 1000 most cited astronomical papers 
and produced likelihoods for papers receiving a certain 
number of citations as a function of the number of years 
that have elapsed since publication. 

In this short work I have calculated two easily de- 
termined measures of success within the astronomical 
community. Although these numbers should be treated 
carefully they do at least provide some indication of 
the impact of a particular researcher on the field and 
relative to their contemporaries. The numbers derived 
here are simply reproducible for any given person and 
comparisons can be made against the averages for the 
community as a whole. 
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Figure 4. Likelihood of an author achieving more than a specified 
number of normaUsed citations within a recent 5 year window. 
The lower curve is for all authors publishing 2 or more papers in 
the interval, the upper curve is for active researchers with 5 or 
more recent papers. 
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